Chat Models
| Organization | Model Name | API Model String | Context length | Quantization |
|---|---|---|---|---|
| Qwen | Qwen3 Coder 30B A3B Instruct | Qwen/Qwen3-Coder-30B-A3B-Instruct | 131000 | FP16 |
| OpenAI | GPT OSS 120B | openai/gpt-oss-120b | 128000 | MXFP4 |
| OpenAI | GPT OSS 20B | openai/gpt-oss-20b | 128000 | MXFP4 |
| DeepSeek | DeepSeek R1 Distill Llama 70B | deepseek-ai/deepseek-r1-distill-llama-70b | 65000 | FP16 |
| Mistral AI | Mistral (7B) Instruct v0.3 | mistralai/Mistral-7B-Instruct-v0.3 | 32768 | FP16 |
Image Models
| Organization | Model Name | Model String for API | Model Type | Default steps |
|---|---|---|---|---|
| Qwen Tongyi MAI | Z Image Turbo | Tongyi-MAI/Z-Image-Turbo | Image Generation | 9 |
| Stability AI | Stable Diffusion 3.5 Large | stabilityai/stable-diffusion-3.5-large | Image Generation | 30 |
| Qwen | Qwen Image Edit | Qwen/Qwen-Image-Edit | Image Edit | 20 |
Vision models
| Organization | Model Name | API Model String | Context length |
|---|---|---|---|
| Qwen | Qwen3-VL 8B Instruct | Qwen/Qwen3-VL-8B-Instruct | 32768 |
| Qwen | Qwen2.5-VL 7B Instruct | Qwen/Qwen2.5-VL-7B-Instruct | 32768 |
Audio models
| Organization | Modality | Model Name | Model String for API |
|---|---|---|---|
| OpenAI | Speech-to-Text | Whisper Large v3 | openai/whisper-large-v3 |
Embedding models
| Model Name | Model String for API | Model Size | Embedding Dimension | Context Window |
|---|---|---|---|---|
| BGE-Large-EN-v1.5 | BAAI/bge-large-en-v1.5 | 326M | 1024 | 512 |